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ABSTRACT 

It  is  natural  to  associate  optimal  estimation  and  opti- 
mal statistical  testing.   In  this  paper  continuous  function 
estimation  of  the  cumulative  distribution  is  used  to  define 
two  test  statistics  that  compete  with  the  Kolmogorov  D 
statistic.   The  first  statistic,  C  ,  is  attributed  to  Pyke 
and  the  second,  R^ ,  is  obtained  by  polygonalizing  the  sample 
distribution  function.   It  is  known  that  both  are  asymp- 
totically equivalent  to  the  Kolmogorov  statistic.   Using 
the  methods  of  J.  Durbin,  the  small  sample  distributions 
are  tabled  as  well  as  the  critical  points  for  significance 
levels  of  .20,  .10,  .05,  .025,  and  .01.   It  is  shown  that 
R^  is  stochastically  smaller  than  DN ,  and  it  appears  that 

CX1  is  also  smaller  than  D„T. 

N  N 
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I.   INTRODUCTION 

The  popular  method  for  testing  the  simple  hypothesis 

H  :  F  =  F  ,  where  F  is  the  unknown  cumulative  distribution 
o        o 

of  the  population  sampled  and  F   is  the  proposed  continuous 
distribution  of  the  sample,  is  the  Kolmogorov  DN  statistic 
[Ref.  4].   The  asymptotic  distribution  of  D  was  developed 
by  Kolmogorov  [Ref.  2]  and  tabled  by  Smirnov  [Ref.  6];  the 
small  sample  distribution  was  tabled  by  Massey  [Refs.  3  and 
4]  and  has  been  treated  recently  (among  others)  by  Durbin 
[Ref.  1].   A  modification  of  the  Kolmogorov  test  has  been 
proposed  by  Pyke  [Ref.  5]  and  the  distribution  of  this  test, 
called  C  ,  can  be  obtained  from  Durbin  [Ref.  1]. 

The  Kolmogorov  D   statistic  is  associated  with  a  step 
function  estimation  of  F  in  the  computation  of  the  test 
statistic.   This  function,  however,  is  from  a  vacuous  class 
under  the  assumptions  of  the  method,  since  a  priorily  F  is 
assumed  to  be  continuous.   Of  course  the  estimator  converges 
to  the  continuous  function  F  as  the  sample  size  becomes 
large  without  limit. 

The  question  of  estimating  the  distribution  F  is  sep- 
arate from  testing  H  .   However,  it  is  natural  to  associate 

^   o 

the  two  and  to  expect  optimal  estimation  to  yield  optimal 
testing  methods.   Two  of  the  statistics  considered  in  this 
paper  may  be  viewed  as  being  based  upon  continuous  esti- 
mators of  F.   It  is  shown  that  each  is  asymptotically 

equivalent  to  D„T. 

^  N 


The  statistics  investigated  are  presented  in  section 
two.   The  asymptotic  nature  of  each  and  the  relation  between 
each  is  shown  in  section  three.   Section  four  presents  the 
computational  method  used  in  tabling  the  distribution  of  the 
statistics  introduced  in  section  two.   Final  comparisons 
are  covered  in  section  four.   The  tables  are  in  the 
appendices . 


II.   BASIC  DEFINITIONS,  KOLMOGQROV  STATISTIC,  PYKE 
STATISTIC,  AND  A  PROPOSED  STATISTIC 

Let  Y,  ,  Y»,.  .  .  ,  Y   be  a  random  sample  from  a  continuous 
population  with  cumulative  distribution  function  F(y).   The 
methods  presented  below  are  for  testing  the  simple  hypothesis 


H   :  F  =  F 
o         o 


The  procedures  discussed  are  based  on  the  order  statistics 
of  the  sample  where 


12  N 


are  the  ordered  sample  points,  and  the  sample  cumulative 
distribution  is 


X  <  X 


FN(x)  =   j/N        X.  <  X  <  Xj  +  1 


X  >  xN   . 


The  Kolmogorov  D   statistic  for  testing  H   is 


DN  =  sup  |FN(x)  -  FQ(x) 


and  has  the  property  of  being  distribution  free  under  H 
[Ref .  4] .   This  property  allows  for  computation  of  the  cum- 
ulative distribution  of  D   to  be  done  with  F   assumed  to  be 

N  o 

uniform  (0,1)  without  loss  of  generality. 


max 


Lemma  1  :  DN  ^^N  {  '  X  j  "  (J/N>UXj  -  ( j  - 1 ) /N  |  } 


Proof:  After  transformation  under  H   the  statistic 

o 


becomes 


dm  =  n.U?i  lx  "  F^(x) 
N    0<X<1  '      N 


The  assertion  is  obvious  from  Figure  one. 

A  modification  of  the  Kolmogorov  statistic  is  attributed 
to  Pyke  [Ref.  5]  where  the  test  statistic  is 

cm  =  -i™3^  lx-  "  J/(n+D  |   . 

Implicit  in  this  method  is  the  idea  that  the  j    order 
statistic  estimates  the  ( ( j+1) /N) (100)  percentile  of  F. 
Such  estimators  are  unbiased  when  F   is  uniform  (0/1). 

The  statistic  C   may  be  viewed  as  being  based  on  a 
continuous  function  estimator  of  F  constructed  by  connect- 
ing the  points  (0,0);  (X1,1/(N+1))  ;.  .  .;  (X^N/ (N+l)  )  ; 
(1,1)  with  straight  line  segments.   This  is  apparent  since 
any  point  on  the  line  segment  L  from  (X.,  j/(N+l))  to 
(X .  , ,  (j+l)/(n+l))  is  a  convex  combination  of  the  endpoints 
and  hence  the  maximum  separation  between  points  on  the  line 
Y  =X  and  L  must  occur  at  one  of  the  endpoints  of  L. 

It  is  of  interest  to  consider  a  sample  cumulative  dis- 
tribution formed  by  connecting  the  points  (0,0); 
(X, ,1/N) ; . . . ;  (X  ,1)  with  straight  line  segments  and  the 
resultant  statistic 
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N 
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$ 


(j-l)/N  -   X. 


-V-+ 


X 


j-1 
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Figure    1.      Lemma    1 
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*N  =  S-P  ISN(X»  "  Fo(x)l 


where  S  (x)  is  the  sample  cumulative  distribution  modified 
as  described  above.   Properties  of  R^  are  presented  in 
section  three  and  its  distribution  for  finite  N  is  tabled 
in  Appendix  B. 
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III.   ASYMPTOTIC  EQUIVALENCE  OF  THE  TEST  STATISTICS 

It  is  well  known  that 

Lim   P[/~  DN>Z]  =  2  Z  (-1)    exp  {-2r2Z2} 
N->°°  r=l 

[Ref.  2]  and  it  is  shown  below  that  both/N  R-.  and  /n  C 
have  the  same  limiting  distribution. 

Lemma  2:  For  all  N  and  all  F  1L  is  stochastically 
smaller  than  D  ,  i.e.,  R^   5.D.,. 

Proof:  By  lemma  one 

DN  =  ^  {lXj  "  j/Nl  '  lXj  "  ^"1)/N|  } 
and  it  follows  from  the  definition  that 


max 


P^  =  "J*  {|X   -  j/N|}   . 


The  result  is  immediately  obvious.   The  latter  is  maximized 
over  a  subset  of  those  for  which  the  former  is  maximized. 
Lemma  3:  For  all  e  >   0 


Lim   P[/N|CN  -  Rj  <  e]  =  1 
N->°° 


Proof:  Consider 


CM  =  maX  |X.  -  j/(N+l) | 
N     j     j 


max  |X.  -  j/N  +  j/N  -  j/(N+l)| 
j     -1 


<  maX  |X.  -  j/N  |  +  1/(N+1)  =  P^  +  1/(N+1) 


13 


Thus   CN  -  Rj^   <   1/(N+1) 


Now  consider 


RN  =  Y  |X.  -  j/N 


mjX  |Xj  -  j/(N+l)  +  j/.(N+l)  -'j/N 


<  mjX  |Xj  -  j/(N+l)|  +  1/(N+1)  =  CN  +  1/(N+1)    . 


Combining  the  two  it  is  seen  that 


/N  |RN  -  CN|  <  /N/(N+1) 


Thus 


p[^  I*n  ~   cnI  <  £]  "  1 


for  all  N  so  large  that 
v/N  /(N+l)  <  e 
Lemma  4:  For  all  e  >  0  it  will  be  shown  that 


Lim   P  y^  D„  -  S,,   <  e]  =  1 
1  N     N ' 
N+00 


Proof:  Let  Q__  =  ,  ™a*   |x.  -  ( j  - 1 )  /N  I   and  note  that 


QN  =l<j<N  lXj  "  i/N  +3/N  -  (j-D/N|  i  ^  +  1/N 


Thus 


/S    |QN  -  Rjjl  <  1/^S"    . 
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Also 


max 


dn  =  i<j<n  {lxj  "  ^Nl  •  lxj  "  <J-U/»I> 

=  max  {P^  ,  Q~} 

Thus  |DN  -  Rj  =  |max  {0,(q"  -  R^) } \    <  | Q~  -  R^    and 
it  follows  that 


Lim  P[/N  |  d  -  Rjj  I  <  e  ]  =  1 

for  all  N  so  large  that  /N  >  1/e  . 

r-  r-1        2  2 

Lemma  5:   Lim   P [ /n  P^  >  Z  ]  =  2  I   (-1)   exp  {2r  Z  } 

N+oo  r=l 


Proof:       P[/NDN>    Z]    =    P  [ /N    P^    +    /N     (Dn    -    P^)     >    Z] 


Lim      P[/NRN+    /N     (D-    P^)     >    Z]    =    Lim      P  [ /N    R>  Z] 
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IV.   COMPUTATIONAL  METHOD 

The  method  used  to  evaluate  the  cumulative  distribution 
function  of  the  statistics  D„,  C„ ,  and  R^  is  from  the  work 
of  Durbin  [Ref .  1] .   A  sketch  of  his  results  and  the  modifi- 
cations necessary  for  the  statistics  under  consideration  are 
presented  in  this  section. 

"Suppose  that  0  <_  x,  <.  x„  £  .  .  .  <  x   <  1  is  an  ordered 
random  sample  of  independent  observations  from  a  uniform 
(0/1)  distribution.   The  sample  distribution  function  is...", 
F>T  (x)  as  defined  in  section  two.   "Let  S  denote  the  sample 
path  of  F  (x)  as  x  moves  from  0  to  1.   In  this  paper  we  con- 
sider the  probability  p  (a,b,c)  that  S  lies  entirely  in  the 
region  R  between  the  lines  ny  =  a+(n+c)x  and  ny  =  -b+(n+c)x, 
(a>0,  b>0,  a+b>0,  and  a-c>0)." 

Define : 


H 


1-6 

1 

0 

(l-62)/2! 

1 

1 

(l-63)/3! 

1/2! 

1 

p   p 

1-6  -@+h 

P! 


l-@ 


P-l 


(P-D  ! 


0 
0 
0 


2! 


0 
0 
0 


i-e 


Durbin,  J.,  "The  Probability  That  the  Sample  Distri- 
bution Function  Lies  Between  Two  Parallel  Straight  Lines." 
The  Annals  of  Mathematical  Statistics,  V.  39,  p.  398, 
April  1968. 
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[z]  =  the    greatest    integer   in    z. 

1-6  =  (b-c)    -    [b-c] 

l-@  =  (a-c)    -    [a-c] 

P  =  [b-c]    +    [a+c]    +    1 

h  e  0    if    6  +  @    £  1    and  h    =    (6  +  @    -    1)P    if    6  +  @    >    1 

q        =    the    ([b-c   +    l],[b+l])    element   of    the   matrix 
H[N+c] 

N  » 
Thus    p    (a,b,c)    =   ^-tr  q         [Ref.    1]  . 

n  (N+c)w 

Lemma   6:    P [D      <   k/N]    =   p    (k,k,0)  [Ref.l]. 

Lemma    7:    P  [C      £.k/(N+l)]    =    p     (k,k+l,l)  [Ref.     1]  . 

Lemma    8:    P [R^   <   k/N]    =   p    (k,k+l,0) 

Proof:   Assume  the  sample  cumulative  distribution, 
S  (x) ,  has  stayed  within  the  bounds  shown  in  Figure  two 
until  it  arrives  at  the  point  (X.,j/N).   The  next  observa- 
tion, X.,,,  must  occur  before  X.,,  for  non-rejection  by 
j+1  3  +  1  J         -1 

the  D   statistic.   For  R   to  accept  the  sample,  the  next 


observation  must  occur  before  X..,.   Since  all  steps  are  of 

1  +  1  ^ 


,R 

height  1/N  the  point  (X.  ,,  (j+l)/N)  is  in  the  region  R  if 


the  lower  band  of  the  equivalent  D   statistic  is  shifted 
downward  by  1/N.   The  resultant  band  is  ny  =  -(b+1)  +  nx. 
If  the  point  (X .  , ,  (j+l)/N)  were  to  fall  in  the  region  W 
for  either  D   or  R   the  hypothesis,  H  ,  would  be  rejected. 
Hence  the  upper  bound  for  the  region  R  is  unchanged. 

The  computer  program  used  to  table  the  distribution  of 
C   and  R  is  presented  in  the  rear  of  this  thesis.   As 
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NY    =    A    +    NX 


Figure    2.      Lemma    8 


18 


written  the  parameter  P  is  restricted  to  be  less  than  or 
equal  to  forty.   The  subroutine  DURMAT  is  used  to  keep  the 
matrix  products  within  the  range  of  the  IBM  360/67  used. 
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V.   COMPARISON  OF  STATISTICS,  REMARKS,  AND  CONCLUSIONS 

The  distribution  of  the  statistics  C   and  K  are  pre- 
sented for  the  first  time  in  Appendix  A  and  B.   Some 
critical  points  for  various  levels  of  significance  are 
presented  in  Appendix  C  for  D  ,  C  ,  and  R^.   It  is  seen 
for  sample  sizes  greater  than  sixty  all  three  statistics 
have  essentially  the  same  critical  points  for  alpha  = 
.20,  .10,  .05,  .025,  and  .01. 

From  lemma  two  it  is  known  that  R^  is  stochastically 

smaller  than  D_T  and  Appendix  C  indicates  that  CAT  is  also 

N       ^^  N 

stochastically  smaller  than  DN  for  the  alpha  values  shown 

under  H  .   Such  results  are  important  in  developing  smaller 

confidence  bands  for  distributions  [Ref .  7] . 

It  has  been  noted  that  the  asymptotic  formula  of  Kol- 

mogorov  is  correct  (to  three  decimal  places)  if  N>60  for  all 

three  statistics.   The  tables  indicate  that  the  limiting 

distribution  is  reached  by  D   earlier  (i.e.,  N>_30)  for  the 

alpha  values  listed.   It  can  be  seen  in  Appendix  C  that  for 

alpha  >_.  05  the  distribution  of  DN  approaches  the  asymtote 

from  above  whereas  both  C,T  and  R.,  approach  it  from  below  for 

N       N  ^^ 

all  listed  values  of  alpha.   Also,  C   and  R^  are  in  essen- 
tial agreement  for  sample  size  greater  than  fifteen. 

Other  statistics  can  be  investigated  empirically  and 
analytically  using  Durbin's  paper.   An  interesting  area  for 
investigation  would  be  to  relate  the  power  functions  of  the 
test  statistics  to  the  parameters  (a,b,c). 
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APPENDIX  A, 


P(  C   <  K/(N+1>  > 

N 


N  = 


1C 


15 


20 


25 


30 


35 


K 
I 

.27006 

.03550 

.00417 

.00047 

.00005 

.0 

.0 

2 

.83719 

.46822 

.23292 

.11016 

.05062 

.02283 

.01016 

3 

.98666 

.83796 

.63607 

.45822 

.32060 

.22015 

.14921 

4 

.99999 

.96862 

.87604 

.75491 

.63168 

.51860 

.42018 

5 

1.0 

.99626 

.96838 

.91124 

.83568 

.75243 

.66858 

6 

.99974 

.99403 

.97403 

.93814 

.89000 

.83417 

7 

.99998 

.99917 

.99390 

.98034 

.95747 

.92640 

a 

.99999 

.99991 

.99885 

.99474 

.98570 

.97099 

9 

.99999 

.99998 

.99981 

.99881 

.99582 

.98984 

10 

1.0 

.99998 

.99995 

.99975 

.99892 

.99683 

11 

.99973 

.99910 

12 

.99991 

.99974 

N 

=   40 

*5 

50 

55 

60 

65 

70 

K 
3 

.10016 

.06674 

.04420 

.02914 

.01913 

.01252 

.00817 

4 

.33714 

.26850 

.21257 

.16749 

.  13154 

.10281 

.08019 

5 

.58829 

.51376 

.44601 

.38532 

.33157 

.28436 

.24317 

6 

.77448 

.71374 

.65384 

.59608 

.54125 

.48980 

.44197 

7 

.88906 

.8^73  9 

.80305 

.75745 

.71160 

.66627 

.62203 

8 

.95C78 

.92581 

.89700 

.86533 

.83164 

.79665 

.76098 

9 

.98030 

.967C3 

.95073 

.93062 

.90829 

.88387 

.85782 

10 

.99288 

.98666 

.97796 

.96680 

.95332 

.93771 

.92024 

11 

.99766 

.98666 

.99097 

.98524 

.97779 

.96860 

.95777 

12 

.99929 

•995C5 

.99657 

.99389 

.99011 

.98512 

.97888 

13 

.99762 

.99587 

.99335 

.99001 

14 

.99911 

.99835 

.99718 

.99551 

N 

75 

8C 

35 

90 

95 

100 

K 
3 

.00532 

.00345 

.00224 

.00144 

4 

•C6238 

.04843 

.03753 

.02903 

5 

.20743 

.17657 

.15001 

.12723 

6 

.39782 

.35731 

.32030 

.28665 

7 

.57929 

.53832 

.49928 

.46229 

.42740 

.39460 

8 

.72512 

.68946 

.65430 

.61988 

.58639 

.55395 

9 

.83060 

.8C257 

.77402 

.74524 

.71647 

.68787 

10 

.90121 

.88087 

.85947 

.83725 

.81443 

.79119 

11 

.94542 

.93169 

.91671 

.90068 

.88374 

.86605 

12 

.97141 

.96275 

.95292 

.94204 

.93020 

.91749 

13 

.98579 

.98067 

.97460 

.96764 

.95982 

14 

.99328 

.99C44 

.98691 

.98271 

.97781 
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APPENDIX  B. 


P(  R   <  K/N  ) 

N 


N  = 


10 


15 


20 


25 


30 


K 

1 

.27854 

.03814 

.00455 

.00051 

.00006 

2 

.83040 

.47338 

.23874 

.11374 

.05253 

.02377 

3 

.98464 

.83505 

.63716 

.46151 

.32425 

.22334 

4 

.99970 

.96661 

.87390 

.75422 

.63247 

.52053 

5 

1.0 

.99582 

.96699 

.90962 

.83454 

.75201 

6 

.99970 

.99358 

.97307 

.93696 

.88891 

7 

.99998 

.99909 

.99354 

.97969 

.95663 

e 

.99999 

.99989 

.99876 

.99449 

.98526 

9 

.99999 

.99997 

.99980 

.99873 

.99564 

10 

1.0 

.99998 

.99996 

.99974 

.99997 

N  = 


35 


40 


45 


50 


55 


60 


K 

.83336 

.77405 

.71369 

.65414 

.59666 

.54205 

7 

.92551 

.88924 

.94672 

.80260 

.75721 

.71152 

a 

.97039 

.95010 

.92511 

.89635 

.86476 

.83117 

9 

.98954 

.97988 

.96658 

.94983 

.93006 

.90773 

10 

.99671 

.99268 

.98638 

.97761 

.96639 

.95237 

11 

.99906 

.99758 

.99492 

.99078 

.98499 

.97748 

12 

.99974 

.99925 

.99825 

.99648 

.99376 

.98993 

13 

.99990 

.99976 

.99942 

.99873 

.99756 

.99577 

14 

.99994 

.99990 

.99979 

.99955 

.99908 

.99831 

N  = 


65 


70 


75 


80 


85 


90 


K 

6 

.49076 

.44302 

.39892 

.35854 

.32142 

.28773 

7 

.66645 

.62239 

.57979 

.53893 

.49999 

.46306 

8 

.79632 

.76078 

.72506 

.68953 

.65448 

.62016 

9 

.69337 

.85739 

.93024 

.80229 

.77383 

.74513 

10 

.93725 

.91979 

.90077 

.88047 

.85911 

.83694 

11 

.96828 

.95714 

.94504 

.93130 

.91633 

.90031 

12 

.98492 

.97864 

.97114 

.96244 

.95261 

.94171 

13 

.99324 

.98986 

.98562 

.98046 

.97438 

.96739 

14 

.99713 

.99543 

.99316 

.99031 

.98677 

.98253 

22 


APPENDIX   C.    ASYMPTOTIC    COMPARISON    OF 


D    , 

N 


C    , 

N 


ANO    R 


N 


N  = 


10 


15 


20 


25 


30 


35 


ALPHA 
.20 


10 


C5 


.025 


01 


A 

r> 
c 
p 

.322 
.338 
.292 
.309 

.266 
.2  84 
.246 
.245 

.231 

.239 
.205 

.205 

.210 
.212 
.193 
.193 

.192 
.196 
.178 
.178 

•  181 
.181 
.166 
.166 

A 
D 
C 
P 

.368 
.386 
.348 
.342 

.3C4 
.316 
.284 
.284 

.264 
.273 
.246 
.246 

.238 
.244 
.226 
.226 

•  221 
.223 

.205 
.205 

.206 
.206 
.195 
.195 

A 
D 
C 
P 

.409 
.431 
.386 
.386 

.338 
.352 
.321 
.321 

.294 
.304 
.281 
.281 

.264 
.272 
.251 
.251 

.242 
.248 
.230 
.230 

.230 
.230 
.217 
.217 

A 
D 
C 
P 

.463 
.453 
.425 
.428 

.378 
.364 
.351 
.351 

.327 
.324 
.300 
.300 

.293 

.292 
.275 
.275 

.268 
.268 
.254 
.25^ 

.248 
.248 
.244 
.244 

A 
0 
C 

P 

.516 
.487 
.478 
.481 

.423 

.404 
.390 
.390 

.365 
.352 
.343 
.343 

.326 
.321 

.308 
.308 

.298 
.293 
.283 
.283 

.276 
.276 
.258 
.258 

N  = 


40 


45 


50 


60 


ALPHA 

.20 

A 

.169 

.160 

.157 

.138 

D 

.169 

.160 

.157 

.138 

C 

.156 

.148 

.140 

.134 

P 

.156 

.148 

.140 

.134 

.10 

A 

.193 

.182 

.173 

.158 

D 

.193 

.182 

.173 

.15e 

C 

.179 

.170 

.161 

.155 

P 

.179 

.170 

.161 

.155 

.05 

A 

.216 

.203 

.194 

.176 

D 

.216 

.203 

.194 

.176 

C 

.200 

.190 

.180 

.173 

P 

.200 

.190 

•  180 

.173 

.C25 

A 

.231 

.218 

.207 

D 

.231 

.218 

.207 

C 

.222 

.209 

.200 

P 

.222 

.209 

.200 

.01 

A 

.258 

.243 

D 

.258 

.243 

C 

.249 

.243 

P 

.249 

.243 

A  = 

ASYMPTOTIC  ESTIMATION 

D  = 

D 

N 

C  = 

C 

N 

P  = 

R 

N 

N>60 


1.07/nfN^ 


1.22  /"Yn1 


1.36 


/YT 


.43/vr1 


1.63/lfN1 
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COMPUTER  OUTPUT 


IA  = 

1 

IB  = 

1 

IC  = 

IA  = 

2 

IB  = 

2 

IC  = 

IA  = 

3 

IB  = 

3 

IC  = 

IA  = 

4 

IB  = 

4 

IC  = 

IA  = 

5 

IB= 

5 

IC  = 

IA  = 

6 

IE= 

6 

IC= 

IA  = 

7 

IP  = 

7 

IC  = 

IA  = 

8 

IB= 

8 

IC  = 

IA  = 

9 

IB  = 

9 

IC  = 

SAMPLE  PROGRAM 

THE  D  STATISTIC  FOR  N=10 

0  NNN=  10  PROB. 

0  NNN=  10  PROB. 

0  NNN=  10  PROB. 

0  NNN=  10  PROB. 

0  NNN=  10  PROB. 

0  NNN=  10  PROB. 

0  NNN=  10  PROB. 

0  NNN=  10  PROB. 

0  NNN=  10  PROB. 


0.0003628 
0.2512797 
0.7294613 
0.9410061 
0.9922168 
0.9994260 
0.9999743 
0.9999936 
0.9999938 


OUT  OF  RANGE  IA=  10  IB=  10  IC=   0  NNN=  10 
THE  ABOVE  IS  AN  EXAMPLE  OF  ONE  OF  THE  ERROR  MESSAGES 
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COMPUTER  PROGRAM 


1  FCRMATH6X, 'SAMPLE    PROGRAM',/) 

2  FORMAT(10X,'THE    D    STATISTIC    FOR    N-10«t/J 

3  FORMATdX,  "THE    ABOVE    IS    AN    EXAMPLE    OF  •  , 
1«    ONE    OF    THE    ERROR    MESSAGES.1/) 


C 
C 


WPITE(6,1) 
WPITE<6,2) 
IC=0 
NNN=10 

DC  10  IA=1,10 
10  CALL  CURBIN(IA,IA,IC,NNN) 
WPITE(6,3) 
STOP 
END 

SUBROUTINE  DURBI N < I  A  ,  I B , IC ,NNN) 
C 

C  THIS  SUBROUTINE  COMPUTES  THE  PROBABILITY  THAT  A 

C  RANDOM  SAMPLE  OF  SIZE  NNN  DRAWN  FROM  A  UNIFORMS, 1) 

C  RANOOM  VARIABLE  AND  PLOTTED  AS  SHOWN  IN  SECTION 

C  FOUR  OF  THIS  THESIS  WILL  STAY  WITHIN  THE  REGION,  R, 

C  BETWEEN  THE  LINES, 

C  NNN*Y=NNN*IA+(NNN+IC)*X 

C  NNN*Y=  -NNN*IB-MNNN+IC)*X 
C 
C 

C  THE  RESTRICTIONS  ARE 

C  IA,  IB,  ANC  IC  MUST  BE  INTEGER  VARIABLES 

C  IC.GE.O 

C  IA.GT.O 

C  IB.GT.O 

C  IA+IC.GT.O 

C  IB-IC.GT.O 


C 


C 
C 


3  FORMAT!///. 2X. •  OUT  OF  RANGE  IA=«,I3,»  IB=»,1  IC=  », 

113, •  NNN=  SI3,//) 
<♦  FORMAT!//, •  IP.GT.NNN  IA=«,I3,»  IB=«,I3, 

1»  IC=',I3f«  IP=  '.IS,'  NNN=  • , 13) 
5  FORMATS  IA=',I3,!  I  B=  •  ♦  I  3»  •  IC=«  ,  I  3,  •  NNN=»,I3, 

1  •  PROB.  =   »,F<5.8) 

DIMENSION  H(1600)  ,HSTORE ( 1600 ), SAVE1 ( 1600 ) ,SAVE2(1600) 
DATA  H/16C0*0.0/ 
DATA  HSTORE/1600*0.0/ 
DATA  SAVE1/1600*0.0/ 
DATA  SAVE2/1600*0.0/ 
DATA  JPH/O/ 

IFUA.LE.O)  GO  TO  10000 
IF(IB.LE.O)  GO  TO  10000 
IF<  (IA+IO.LE.O)  GO  TC  10000 
IF( (IB-IC).LE.C)  GO  TO  10000 
IF(JPH.GT.O)  GC  TO  IOC 
CALL  DURMAT(HST0RE,40) 
JPH=100 
100  IP=IA+IB+1 

IFUP.GT.4C)  GC  TO  10000 

IJ=IB*IP+IB-IC-H 

IFC I ( IB+IC).GE.NNN).AND.  (  ( I  A+IC ) . GE.NNN ) )  GO  TO  11000 

NSIZE=IP*IP 

JGOT=0 

DC  500  1*1, IP 

JGET=(I-1)*40 

DO  50C  J=1,IP 

JGET=JGET+1 

JG0T=JG0T+1 


25 


500  H( JGOT)=HSTORE( JGET) 

DC  600  J=1,IP 

M=J*IP 
600  H(M)=0.0 

DO  700  J=1,IP 

M=( J-1)*IP+J 
700  SAVE2(M)=1.0 

ICK=NNN+IC 

FACT=REDUCE(NNN,IC,1) 
750  CONTINUE 

DC  8CC  J=1.NSIZE 
90C    SAVE2( J)=SAVE2< J)*FACT 

CALL  GMPRD(SAVE2,H,SAVE1,IP,IP,IP> 

IPATH=1 

ICK=ICK-1 

IF(  ICK.LE.O)  GC  TO  2000 

FACT=PEDUCE(NNN,IC,2) 

DO  900  J=1,NSIZE 
900  SAVEK  J)=SAVE1(  J)*FACT 

CALL  GMPRD(SAVE1,H,SAVE2,IP,IP,IP) 

IFATH=2 

ICK=ICK-1 

IF(  ICK.LE.O)  GO  TO  2000 

FACT=PEDUCE(NNN,IC,2) 

GO    TO    750 
2300    GO    TO    (2010,2100),    IPATH 
2010  WPITE(6,5)IA,IB,IC,NNN,SAVE1(IJ) 

GO    TO    2500 
2  IOC  WPITE(6,5)IA,IB,IC,NNN,SAVE2(IJ) 
2500  DC  2600  I=1,NSIZE 
260C  SAVE2(I)=0.0 

RETURN 
1C0O0  CONTINUE 

WPITE(6,3)IA,IB,IC,NNN 

RETURN 
11000  WRITE<6,4)IA,IB,IC,IP,NNN 

RETURN 

END 

FUNCTION  PEDUCE(NNN,IC,IENTER) 
C 

C  THIS  SUBROUTINE  COMPUTES  THE  TRANSITION  MATRIX 

C  H    OF    SECTION    FOUR    OF    THIS    THESIS    FOR    INTEGER    VALUES 

C      OF  IAf  IB,  AND  IC. 
C 
C 

GO    TO    (1,100) ,IENTER 
1    CONTINUE 

Z=NNN-HC 

W=NNN 

ICK=NNN-1 

Y=0.0 

IPATH=2 

SAVE=W/Z 

y  —  y_  1 

REDUCE=SAVE 

RETURN 
100    IFUCK.LE.C)    GO    TO    10C0 

GO  TO(200,300) , IPATH 
200    IPATH=2 

ICK=ICK-1 

SAVE=W/Z 

W=W-1.0 

REDUCE=SAVE 

RETURN 
30C  IPATH=1 

Y=Y+1.0 

ICK=ICK-1 

REDUCE=Y/Z 

RETURN 
1000    REDUCE=1.0 
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RFTURN 
END 

SUBROUTINE  GMPRD < A, B,R, N, M,L ) 

C  THIS  SUBROUTINE  COMPUTES  THE  MATRIX  PROCUCT  OF 

C      A*B  AND  STORES  THE  RESULTS  IN  R. 

OIMENSION  A(l)  ,B<1), R(l) 

IP=0 

IK=-M 

DO  10  K=1,L 

IK=IK*M 

DO  10  J=1,N 
IR=IR+1 
JI=J-N 
IB  =  IK 
R(IR)=0.0 
DO  10  1=1, M 
JI=JI+N 
IB=IP+1 
10  R(IR)=P( IP)*A< JI)*B(IB> 
RETURN 
END 

SUBROUTINE  DURMAT < CUR , JCUR ) 
C 

C      THIS  SUBROUTINE  PROVIDES  THE  SCALE  FACTOR  CF 
C      P(IAtlBtlC)  OF  SECTION  FOUR  IN  THE  FOLLOWING  ORDER, 
C      N/N,1/N,(N-1)/N,2/N,... 


C 


DIMENSION    DURU) 

NNN=JDUP*JCUR 

DO    10    1=1, NNN 
10    DUP( I)=O.C 

IJ=JDUR 

DO    200    I=2,JDUR 

IJ=IJ+I-1 

DUR( IJ)=1.C 

IJ=IJ+1 

DUR< IJ)=1.0 

IF( IJ.GE.NNN)    GC    TO    3C0 

NN=H-1 

Z  =  2.0 

DO    100    IK=NN,JDUR 

IJ=IJ+1 

DUR( IJ)=DUR( IJ-1)*(1.0/Z) 
100  Z=Z*1.C 
200  CONTINUE 
300  RETURN 

END 
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